40 research outputs found
Phylogenetic Analysis of Cell Types using Histone Modifications
In cell differentiation, a cell of a less specialized type becomes one of a
more specialized type, even though all cells have the same genome.
Transcription factors and epigenetic marks like histone modifications can play
a significant role in the differentiation process. In this paper, we present a
simple analysis of cell types and differentiation paths using phylogenetic
inference based on ChIP-Seq histone modification data. We propose new data
representation techniques and new distance measures for ChIP-Seq data and use
these together with standard phylogenetic inference methods to build
biologically meaningful trees that indicate how diverse types of cells are
related. We demonstrate our approach on H3K4me3 and H3K27me3 data for 37 and 13
types of cells respectively, using the dataset to explore various issues
surrounding replicate data, variability between cells of the same type, and
robustness. The promising results we obtain point the way to a new approach to
the study of cell differentiation.Comment: Peer-reviewed and presented as part of the 13th Workshop on
Algorithms in Bioinformatics (WABI2013
Probabilistic partitioning methods to find significant patterns in ChIP-Seq data
Motivation: We have witnessed an enormous increase in ChIP-Seq data for histone modifications in the past few years. Discovering significant patterns in these data is an important problem for understanding biological mechanisms. Results: We propose probabilistic partitioning methods to discover significant patterns in ChIP-Seq data. Our methods take into account signal magnitude, shape, strand orientation and shifts. We compare our methods with some current methods and demonstrate significant improvements, especially with sparse data. Besides pattern discovery and classification, probabilistic partitioning can serve other purposes in ChIP-Seq data analysis. Specifically, we exemplify its merits in the context of peak finding and partitioning of nucleosome positioning patterns in human promoters. Availability and implementation: The software and code are available in the supplementary material. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics onlin
Probabilistic partitioning methods to find significant patterns in ChIP-Seq data
Motivation: We have witnessed an enormous increase in ChIP-Seq data for histone modifications in the past few years. Discovering significant patterns in these data is an important problem for understanding biological mechanisms. Results: We propose probabilistic partitioning methods to discover significant patterns in ChIP-Seq data. Our methods take into account signal magnitude, shape, strand orientation and shifts. We compare our methods with some current methods and demonstrate significant improvements, especially with sparse data. Besides pattern discovery and classification, probabilistic partitioning can serve other purposes in ChIP-Seq data analysis. Specifically, we exemplify its merits in the context of peak finding and partitioning of nucleosome positioning patterns in human promoters
Study of cell differentiation by phylogenetic analysis using histone modification data
Background: In cell differentiation, a cell of a less specialized type becomes one of a more specialized type, even though all cells have the same genome. Transcription factors and epigenetic marks like histone modifications can play a significant role in the differentiation process.Results: In this paper, we present a simple analysis of cell types and differentiation paths using phylogenetic inference based on ChIP-Seq histone modification data. We precisely defined the notion of cell-type trees and provided a procedure of building such trees. We propose new data representation techniques and distance measures for ChIP-Seq data and use these together with standard phylogenetic inference methods to build biologically meaningful cell-type trees that indicate how diverse types of cells are related. We demonstrate our approach on various kinds of histone modifications for various cell types, also using the datasets to explore various issues surrounding replicate data, variability between cells of the same type, and robustness. We use the results to get some interesting biological findings like important patterns of histone modification changes during cell differentiation process.Conclusions: We introduced and studied the novel problem of inferring cell type trees from histone modification data. The promising results we obtain point the way to a new approach to the study of cell differentiation. We also discuss how cell-type trees can be used to study the evolution of cell types
Genome-Wide Evaluation of Histone Methylation Changes Associated with Leaf Senescence in Arabidopsis
Leaf senescence is the orderly dismantling of older tissue that allows recycling of nutrients to developing portions of the plant and is accompanied by major changes in gene expression. Histone modifications correlate to levels of gene expression, and this study utilizes ChIP-seq to classify activating H3K4me3 and silencing H3K27me3 marks on a genome-wide scale for soil-grown mature and naturally senescent Arabidopsis leaves. ChIPnorm was used to normalize data sets and identify genomic regions with significant differences in the two histone methylation patterns, and the differences were correlated to changes in gene expression. Genes that showed an increase in the H3K4me3 mark in older leaves were senescence up-regulated, while genes that showed a decrease in the H3K4me3 mark in the older leaves were senescence down-regulated. For the H3K27me3 modification, genes that lost the H3K27me3 mark in older tissue were senescence up-regulated. Only a small number of genes gained the H3K27me3 mark, and these were senescence down-regulated. Approximately 50% of senescence up-regulated genes lacked the H3K4me3 mark in both mature and senescent leaf tissue. Two of these genes, SAG12 and At1g73220, display strong senescence up-regulation without the activating H3K4me3 histone modification. This study provides an initial epigenetic framework for the developmental transition into senescence
Multi Pattern Dynamic Time Warping For Automatic Speech Recognition
We are addressing the problem of jointly using multiple noisy speech patterns for automatic speech recognition (ASR), given that they come from the same class. If the user utters a word K times, the ASR system should try to use the information content in all the K patterns of the word simultaneously and improve its speech recognition accuracy compared to that of the single pattern based speech recognition. T address this problem, recently we proposed a Multi Pattern Dynamic Time Warping (MPDTW) algorithm to align the K patterns by finding the least distortion path between them. A Constrained Multi Pattern Viterbi algorithm was used on this aligned path for isolated word recognition (IWR). In this paper, we explore the possibility of using only the MPDTW algorithm for IWR. We also study the properties of the MPDTW algorithm. We show that using only 2 noisy test patterns (10 percent burst noise at -5 dB SNR) reduces the noisy speech recognition error rate by 37.66 percent when compared to the single pattern recognition using the Dynamic Time Warping algorithm
Joint evaluation of multiple speech patterns for speech recognition and training
We are addressing the novel problem of jointly evaluating multiple speech patterns for automatic speech recognition and training. We propose solutions based on both the non-parametric dynamic time warping (DTW) algorithm, and the parametric hidden Markov model (HMM). We show that a hybrid approach is quite effective for the application of noisy speech recognition. We extend the concept to HMM training wherein some patterns may be noisy or distorted. Utilizing the concept of ``virtual pattern'' developed for joint evaluation, we propose selective iterative training of HMMs. Evaluating these algorithms for burst/transient noisy speech and isolated word recognition, significant improvement in recognition accuracy is obtained using the new algorithms over those which do not utilize the joint evaluation strategy
Joint decoding of multiple speech patterns for robust speech recognition
We are addressing a new problem of improving automatic speech recognition performance, given multiple utterances of patterns from the same class. We have formulated the problem of jointly decoding K multiple patterns given a single Hidden Markov Model. It is shown that such a solution is possible by aligning the K patterns using the proposed Multi Pattern Dynamic Time Warping algorithm followed by the Constrained Multi Pattern Viterbi Algorithm The new formulation is tested in the context of speaker independent isolated word recognition for both clean and noisy patterns. When 10 percent of speech is affected by a burst noise at -5 dB Signal to Noise Ratio (local), it is shown that joint decoding using only two noisy patterns reduces the noisy speech recognition error rate to about 51 percent, when compared to the single pattern decoding using the Viterbi Algorithm. In contrast a simple maximization of individual pattern likelihoods, provides only about 7 percent reduction in error rate